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EDGEWORTH  EXPANSIONS  IN  STATISTICS:  A  BRIEF  REVIEW* 


by 

Shanti  S.  Gupta  d  S.  Panchapakesan 

Purdue  University  Southern  Illinois  University 

1.  Introduction.  The  study  of  approximations  to  distributions  has  been 
an  important  part  of  statistical  investigations  since  the  early  part  of  this 
century.  Chari ier,  Chebyshev,  and  Edgeworth  were  notable  contributors  among 
others.  Though  the  interest  initially  was  the  approximation  to  empirical 
distributions  by  theoretical  functions,  the  focus  quickly  shifted  to  approximate 
evaluation  of  distribution  functions  or  quantiles  of  complicated  distributions. 
Such  approximations  are  of  vital  importance  in  statistical  inference,  especially 
in  the  investigation  of  robustness  of  standard  tests  of  hypotheses  and  of 
estimators. 

Consider  a  sequence  of  statistics,  {TN> ,  N  >_  1 ,  where  N  usually  denotes 
the  sample  size.  The  distribution  function  (d.f.)  of  T^  is  said  to  possess 
an  asymptotic  expansion  val id  to  (r+1)  terms  if  functions  Ag,...,Ar  can  be 
found  such  that 


(1.1) 


r  A .  (x) i 

fn(x)  ■  Vx) "  i,  d/H 


j=l  NJ 


o(N_r/2). 


The  expansion  is  said  to  be  uniformly  valid  to  (r+1 )  terms  if 


(1.2) 


sup 

X 


r  A.(x) 


fnm  -  *0(x)  -  jri 


=  o(N'r/2). 


It  is  to  be  noted  that,  in  defining  the  above  concepts,  Wallace  (1958) 


/ 


-  ( r+1 ) J 2  ‘ 

requires  the  remainder  to  be  0(N  '  "  ).  Our  definition  is  in  accord  with/ 
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Erdelyi  (1956)  and  is  used  by  Bickel  (1974)  and  many  other  authors.  An 
expansion  which  is  valid  to  just  one  term  gives  an  ordinary  limit  theorem. 

The  higher  order  terms  are  of  practical  interest.  The  basic  approximation 
Ag  can  often  be  greatly  improved  by  taking  one  or  two  additional  terms  of 
the  expansion.  Further,  these  expansions  are  useful  and  necessary  to 
discriminate  procedures  equivalent  to  first  order.  This  point  has  been  clearly 
brought  out  by  Hodges  and  Lehmann  (1970). 

Expansions  of  the  type  (1.1)  are,  in  general ,  known  as  Edgeworth 
expansions.  The  related  expansions  of  Fj^  are  the  Cornish-Fisher  expansions. 

Of  course,  there  are  some  other  expansions  of  interest  such  as  the  expansions 
for  density  functions  and  frequency  functions  of  lattice  random  variables.  In 
some  cases,  better  approximations  can  be  obtained  based  on  saddle  point  method; 
however,  this  involves  a  deep  knowledge  of  the  characteristic  function  of  F^ 
which  may  not  be  usually  available.  Our  interest  here  is  mainly  confined  to 
Edgeworth  and  Cornish-Fisher  expansions  in  which  AQ(x)  =  4> ( x ) ,  the  standard 
normal  distribution  function  (as,  in  fact,  the  case  was,  when  they  were  first 
introduced).  It  is  perhaps  appropriate  at  this  point  to  make  a  comment  about 
the  Edgeworth  series  and  the  related  Chari ier's  A-series.  Charlier's  paper 
was  published  in  1905  and  Edgeworth's  in  1907.  However,  Gnedenko  and 
Kolmogorov  (1968,  Chapter  8)  point  out  that  both  these  types  of  expansions 
appear  already  in  the  work  of  Chebyshev.  For  further  historical  information 
about  these  expansions,  we  refer  to  Cramer  (1972)  and  Sarndal  (1971). 

Another  problem  that  has  always  been  of  interest  is  to  obtain  suitable 
bounds  for  sup|F^(x)  -  Ag(x)|  or  in  other  words,  to  determine  the  rate  of 
convergence  to  Ag.  These  bounds  are  called  Berry-Esseen  bounds.  Tor  basic 
contributions  to  this  problem,  one  should  refer  to  Berry  (1941),  Esseen 
(1942,  1945),  and  Bergstrom  (1944,  1945,  1949).  In  the  course  of  this 


paper,  we  will  refer  to  some  recent  results  regarding  these  bounds. 


Significant  contributions  to  the  theory  of  asymptotic  expansions  were 
made  by  Rao  (1960)  who  obtained  Edgeworth  expansions  and  Berry-Esseen  bounds 
for  sums  of  independent  random  vectors.  These  and  other  related  developments 
in  the  field  including  the  results  of  B.  von  Bahr,  R.  N.  Bhattacharya  and 
A.  Bikjalis  are  discussed  in  Bhattacharya  and  Rao  (1976).  Two  important  survey 
papers  are  those  of  Wallace  (1958)  and  Bickel  (1974).  The  best  textbook 
references  for  the  basic  introduction  to  Edgeworth  series  are  Cramer 
(1951)  and  Kendall  and  Stuart,  Vol .  I  (1977).  For  excellent  accounts  of  the 
theory  of  Edgeworth  expansions  for  sums  of  independent  random  variables,  one 
should  also  refer  to  Cramer  (1962),  Gnedenko  and  Kolmogorov  (1968),  Feller 
(1971)  and  Petrov  (1975). 

In  reviewing  the  development  of  Edgeworth  expansions  in  statistics,  we 
do  not  attempt  to  be  exhaustive  -  neither  in  coverage  nor  in  details.  We 
restrict  our  attention  to  some  of  the  recent  developments.  These  relate  to 
transformations  of  Edgeworth  series  (Section  2),  Cornish-Fisher  expansions 
(Section  3),  Berry-Esseen  bounds  and  Edgeworth  expansions  for  test-statistics 
such  as  linear  rank  statistics,  U-statisti cs ,  and  linear  combinations  of 
order  statistics  (Section  4),  expansions  for  minimum  contrast  estimators  and 
Fisher-consistent  estimators  (Section  5),  Edgeworth  and  Cornish-Fisher 
expansions  in  selection  and  ranking  problems  (Section  6),  and  asymptotic 
expansions  for  statistics  with  nonnormal  limiting  distributions  (Section  7). 

2.  Edgeworth  Series  for  Sums  and  Tranformations  of  Series.  Let 
(Xn}’  n  1 ,  be  a  sequence  of  independent  and  identically  distributed  (i.i.d.) 
random  variables  and  let  Fr  denote  the  d.f.  of  the  standardized  sum 

n 

Yn  =  where  v  =  E(X])  and  o2  =  V(X1 ) .  Now,  letting  the 
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p 

cumulants  Kr  =  o  Ar>  r  >_  3,  the  first  few  terms  of  the  Edgeworth  expansion 
are  given  by 


a*),x)  ,  rx4$(4)(x)  a^(6)(x)1 

F"w  =  *(x)  ■  ‘3  4  ir-w- +  -Hr—  J  4 . 

where  <}>'  '(x)  denotes  the  rth  derivative  of  <t>(x)  with  respect  to  (w.r.t.) 
x.  Cramer  (1928)  proved  that  the  series  is  valid  uniformly  in  x  provided 
that  one  more  cumulant  exists  than  used  in  any  partial  sum  and  the  character¬ 
istic  function  ^ ( t )  of  satisfies  the  condition 


(2.1)  lim  sup |  ip(t)  |  <  1. 

1 1 1  -*  <*> 

This  condition  (commonly  now  called  the  Cramer's  condition)  is  satisfied  if 
the  distribution  of  has  an  absolutely  continuous  part.  For  discrete 
distributions,  the  condition  is  not  satisfied  and  in  this  case  the  result 
is  not  generally  valid;  however,  a  different  expansion  is  available.  Takeuchi 
and  Akahira  (1977)  have  given  Edgeworth  expansion  of  when  moments  do  not 
necessarily  exist  but  when  the  density  can  be  approximated  by  rational 
functions. 

The  theory  of  Edgeworth  expansions  for  sums  of  independent  random 
variables  is  fairly  wel 1 -developed  and  for  excellent  accounts  one  can 
refer  to  any  of  the  books  mentioned  in  this  regard  in  the  previous  section. 

Suppose  we  have  a  sequence  of  distributions  of  random  vectors  approximated 
to  a  certain  order  of  accuracy  by  an  Edgeworth  series.  Bhattacharya  and  Ghosh 
(1978)  proved  that  such  an  expansion  may  be  transformed  by  a  sequence  of  smooth 
functions  of  the  corresponding  random  vectors  to  yield  a  valid  Edgeworth 
expansion  of  the  resulting  sequence  of  distributions.  This  latter  expansion 
may  be  obtained  by  the  so-called  delta  method  in  which  the  moments  of  a  function 
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of  a  random  vector  are  formally  calculated  from  a  Taylor  series  expansion  of 
the  function.  The  conditions  needed  to  be  satisfied  are  essentially  concerned 
with  the  derivatives  of  the  functions.  Skovgaard  (1981a)  gives  a  generalization 
of  the  Bhattacharya-Ghosh  theorem.  These  results  provide  the  mechanism  for 
Edgeworth  expansions  of  distributions  of  statistics  in  non-standard  cases. 
Skovgaard  (1981b)  has  used  this  approach  to  obtain  Edgeworth  expansion  of  the 
distribution  of  maximum  likelihood  estimators  in  the  general  (non  i.i.d.)  case. 


3.  Cornish-Fisher  Expansions.  Typically,  in  many  statistical  applications, 
we  require  quantiles  of  a  distribution.  Starting  with  an  Edgeworth  expansion 
of  a  distribution  function  F  ,  Cornish  and  Fisher  (1937)  obtained  an  asymptotic 
expansion  of  a  quantile  x  of  Fn  in  terms  of  the  corresponding  normal  quantile  z 
by  means  of  formal  substitutions,  Taylor  expansion  and  identification  of  powers 
of  n.  The  expansion  is  of  the  form 


(3.1) 


P,(z)  P?(z)  P.(z)  P,(z) 

X  =  Z  +  — -  +  — -  +  ^ 


/n 


n 


n3'2  n2 


where  the  P^z)  are  polynomials  in  z.  We  refer  to  P^(z)  as  the  ith  adjustment. 

Cornish  and  Fisher  (1937)  have  tabled,  for  nine  probability  levels, 

all  the  polynomials  needed  to  obtain  all  terms  through  the  fourth  adjustment 

based  on  the  sixth  cumulant.  Fisher  and  Cornish  (1960)  extended  the  formulae  and 

tables  to  the  sixth  adjustment  based  on  the  eighth  cumulant  for  ten  (the  earlier 

nine  and  one  more)  probability  levels.  These  are  further  extended  to 

order  n’4  by  Draper  and  Tierney  (1973)  and  to  order  n~®  by  Hill  (1964). 

Now,  we  can  also  expand  z  in  terms  of  x  in  a  form  similar  to  (3.1).  This 
is  known  as  the  normalizing  expansion.  In  fact,  this  is  obtained  as  an 


intermediate  step  in  getting  (3.1).  It  is  also  useful  in  itself  as  it  provides 
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an  asymptotic  transformation  of  a  random  variable  X  with  distribution  Fn 
into  a  standard  normal  random  variable.  Both  expansions  are  referred  to 
as  Cornish-Fisher  expansions.  For  an  absolutely  continuous  distribution 
both  expansions  are  valid  for  every  probabi 1 i ty  level  whenever  the  initial 
Edgeworth  series  is  valid.  This  can  be  proved  by  adapting  the  proof  of  Wasow 
(1956)  for  the  invertabil ity  of  a  special  class  of  distribution  expansions. 
Incidentally,  Hill  and  Davis  (1968)  obtained  formal  expansions  which  generalize 
Cornish-Fisher  relations  to  an  arbitrary  analytic 

The  Edgeworth  and  Cornish-Fisher  expansions  require  the  knowledge  ol 
the  cumulants  of  the  distributions  involved.  Gray,  Coberly  and  Lewis  (1975) 
showed  how  the  general  Edgeworth  expansion  can  be  suitably  utilized  to 
eliminate  the  requirement  of  knowing  the  cumulants  without  affecting  the  order 
of  the  error  of  approximation.  McCune  and  Gray  (1975)  used  this  result  along 
with  the  expansions  of  Hill  and  Davis  (1968)  to  obtain  Cornish-Fisher  type 
expansions  with  unknown  cumulants.  However,  these  new  expansions  lack  simplicity 
for  applications.  Using  a  technique  analogous  to  that  used  by  Gray,  Coberly 
and  Lewis  (1975),  a  simpler  expression  was  obtained  by  McCune  (1977). 

Finally,  it  should  be  noted  that  the  Edgeworth  and  Cornish-Fisher  expansions 
are  known  to  have  some  deficiencies  which  show  up  in  tails  of  the  distribution. 

The  approximations  for  Fn(x)  are  not  probability  distributions.  Further,  the 
monotonicity  property  as  well  as  the  zero-one  ramie  property  are  violated  in 
parts  of  either  or  both  tails.  Correspondingly,  the  approximations  for 
quantiles  are  not  always  monotonic  in  the  probability  level. 

4.  Berry-Esseen  Bounds  and  Edgeworth  Expansions  for  Certain  Test  Statistics. 


As  we  have  already  mentioned,  a  reasonably  complete  theory  of  Berry-Esseen  bounds 
and  Edgeworth  expansions  is  available  for  sums  of  independent  random  variables 
and  vectors.  In  recent  years,  Berry-Esseen  bounds  and  asymptotic  expansions 
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have  been  obtained  for  several  statistics  occurring  in  statistical  estimation 
and  tests  which  are  of  a  different  structure.  In  this  section,  we  discuss  these 
results  tor  linear  combination  of  order  statistics,  spacings,  simple  linear 
rank  statistics  and  U-statistics. 

In  obtaining  these  results  for  statistics  in  these  situations,  one 
method  is  to  obtain  a  stochastic  expansion  for  T^;  in  other  words,  is 
approximated  sufficiently  accurately  by  a  statistic  which  has  a  simpler 
structure.  The  desired  results  are  then  proved  for  instead  of  T^.  In 
many  cases  that  were  first  considered,  is  a  smooth  function  of  a  sum  of 
independent  random  vectors  and  the  problem  is  solved  by  the  use  of  the 
classical  theory.  In  some  other  cases,  is  of  a  different  type.  An 

important  example  of  such  is  a  U-statistic.  This  situation  arises,  for 
example,  when  we  are  dealing  with  one-sample  linear  rank  statistic  and  linear 
combinations  of  order  statistics. 

Another  technique  used  by  Albers,  Bickel  and  van  Zwet  (1976)  and  Bickel 
and  van  Zwet  (1978)  to  obtain  Edgeworth  expansions  for  the  one-  and  two-sample 
problems  is  based  on  conditioning.  With  the  right  conditioning,  it  turns  out 
easier  to  obtain  an  Edgeworth  expansion  for  the  conditional  d.f.  of  the  linear 
rank  statistic  T^.  One  can  then  obtain  an  expansion  for  the  unconditional 
d.f.  of  T^  by  taking  the  expected  value. 

Albers,  Bickel  and  van  Zwet  (1976)  gave  a  rigorous  proof  of  Edgeworth 
expansion  for  the  d.f.  of  the  general  linear  rank  statistic  for  the  one-sample 
problem  under  the  null  hypothesis  as  well  as  general  alternatives.  They  have 
also  shown  that  the  expansion  can  be  greatly  simplified  by  considering  contiguous 
location  alternatives  and  smooth  scores.  Albers  (1979)  specialized  the  general 
results  to  certain  contiguous  nonparametric  alternatives  and  smooth  scores. 


•-  ...  _ 
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Bickel  and  van  Zwet  (1978)  provided  the  Edgeworth  expansion  for  the  general 
two-sample  linear  rank  statistic  under  the  null  hypothesis  as  well  as 
contiguous  location  alternatives.  The  expansion  under  the  null  hypothesis 
has  been  obtained  also  by  Robinson  (1978).  Numerical  aspects  of  these 
expansions  are  considered  in  Albers  (1974).  Rogers  (1971)  obtained  an 
expansion  under  the  null  hypothesis  in  the  special  case  of  the  two-sample 
Wilcoxon  statistic  but  his  proof  appears  to  be  in  error.  A  Berry-Esseen 
bound  in  this  special  case  was  obtained  by  Stoker  (1954). 

Suppose  that  X^.X^,...^  are  independent  random  variables  with  density 
functions  f^ .fg,. . . respectively.  Let  Rj  be  the  rank  ot  X.  when  the  Xi 
are  arranged  in  increasing  order.  For  sequences  of  real  numbers  c-|,...,c., 
and  a,,...,a^,  we  define  a  simple  linear  rank  statistic  by 
N 


=  l  CjaR  .  Here  the  ai  are  the  scores.  By  taking  c-|  =...=  cffl  =  0, 

cm+1  =...=  cN  =  1  for  1  <  m  <  N,  we  get  the  two-sample  rank  statistic.  The 
behavior  of  the  characteristic  function  of  a  suitably  standardized  T^  for 
the  large  values  of  the  argument  has  been  investigated  by  van  Zwet  (1980). 
Using  this  result,  an  Edgeworth  expansion  with  remainder  o(N~^)  is  obtained 
by  Does  (1981)  for  simple  linear  rank  statistics  under  the  null  hypothesis. 
His  theorem  holds  for  a  wide  class  of  scores  generating  functions  which  in¬ 
cludes  the  normal  quantile  function. 

Suppose  that  X-j  ^  ±  ^  <_. .  .<  X^  ^  are  order  statistics  corresponding 

to  X.|  •  • » X^  which  are  i.i.d.  with  distribution  F.  Let 


tn  =  n 


-1 


1 1 

l  c.  MX.  N,  where  the  c.  N  are  known  real  numbers  (weights). 

j  =  ]  J  i  ^  J  ,  IN 


Statistics  of  the  form  T.,  are  linear  combinations  of  order  statistics,  in 

N 

this  case,  Berry-Esseen  bounds  are  given  by  Bjerve  (1977)  and  Helmers 
(1977,  1981a, b).  Edgeworth  expansion  for  T^  with  smooth  weight  functions  is 
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obtained  by  Helmers  (1980)  under  the  assumption  that  the  underlying  distribu¬ 
tion  l mut ion  possesses  a  finite  fourth  moment  and  a  certain  local  smoothness 
property.  His  proof  uses  Theorem  4.1  of  van  Zwet  (1977)  which  provides  a 
bound  for  the  characteristic  function  of  T^.  For  the  special  case  in  which 
the  underlying  distribution  is  uniform,  the  above  results  of  Bjerve,  Helmers 
and  van  Zwet  are  substantially  weaker  than  the  result  of  van  Zwet  (1979). 

The  results  of  Helmers  (1980)  for  linear  combination  of  order  statistics 
do  not  include  trimmed  means.  However,  Bjerve  (1974)  has  shown  that  trimmed 
means  admit  asymptotic  expansion.  His  proof  employs  a  special  property  of 
trimmed  means  and  thus  does  not  apply  to  more  general  trimmed  linear  combina¬ 
tions.  Expansion  in  the  general  case  has  been  obtained  by  Helmers  (1979). 

These  and  related  results  are  summarized  in  Helmers  (1981b).  A  brief  but 
clear  account  of  the  results  of  Bjerve  (1977)  and  Helmers  (1977)  is  given 
by  van  Zwet  (1977). 

Suppose  that  U^U^,...  is  a  sequence  of  i.i.d.  random  variables  which 
are  uniformly  distributed  on  (0,1).  For  N  =  1,2,...,  let 
U1  N  "  U2  n  -•••-  UN  N  be  tbe  order  statistics.  Let  UQ  ^  =  0  and  UN+^  N  =  1 . 
The  spacings  are  defined  by  D.^  =  U^  1  N’  ^  =  ^>•••’^+1.  Let 

g^:  (0,»')  ->  IR  ,  N  >  1 ,  be  a  sequence  of  measurable  functions  and  define 

N+l 

Tn  =  V  gN((N+l)D.N),  Ml.  Does  and  Helmers  (1980)  have  obtained 

Edgeworth  expansion  for  T^.  They  have  also  established  a  Berry-Esseen 
bound  of  order  0(N_i:)  for  normalized  T^  but  under  conditions  that  are  hard 
to  check.  The  same  bound  is  obtained  by  Does  and  Klaassen  (1981)  under 
conditions  which  are  easier  to  check. 

Another  important  class  of  statistics  are  known  as  U-stati sti cs .  In 
finding  a  T^  which  stochastically  approximates  T^  at  hand,  a  possible  situation 
is  that  T^  is  a  U-statistic  of  order  k  (1  j<  k  N).  In  other  words. 
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T 


N 


V  h(X.  ,...,X.  ), 

ik  <  N  1 1  \ 


where  X^,...,X^  are  i.i.d.  random  variables.  One-sample  linear  rank  statistics 
and  linear  combinations  of  order  statistics  are  known  examples  of  such  T^. 
Berry-Esseen  type  results  establishing  convergence  to  normality  of  the  d.f. 
of  a  standardized  at  the  rate  of  N’a  have  been  given  by  Bickel  (1974), 

Chan  and  Wierman  (1977)  and  Callaert  and  Janssen  (1978).  These  authors 
discuss  U-statistics  of  order  2;  however,  as  pointed  out  by  these  authors, 
all  their  results  hold  for  U-statistics  of  any  fixed  order  and  also  hold  in 
the  multi-sample  case  [see  Janssen  (1978)]. 

To  describe  the  results  for  U-statistics  of  order  2,  define 


UN  = 


<  l 


h(Xi,Xj), 


where  the  kernel  h  is  a  symmetric  function  of  two  variables  with 

2 

Eh  ( X-j ,  X2 )  =  0.  Let  denote  the  variance  of  U^.  Asymptotic  normality  of 

-1  2 
oN  UN  was  proved  by  Hoeffding  (1948)  under  the  sole  condition  that  Eh  (X^X^) 

exists.  Grams  and  Serfling  (1973)  showed  that  supjP[o^U^  <  x]  -  v(x)j  = 

x 

0(M_r/(2r+1 ) )  when  Eh2r  <  this  gives  0(N~2+e)  when  h  has  finite  moments 
of  all  orders.  An  order  bound  of  exactly  0(N  * )  was  obtained  by  Bickel  (1974) 
assuming  that  the  kernel  h  is  bounded.  The  same  order  bound  was  obtained  by 
Chan  and  Wierman  (1977)  assuming  only  the  existence  of  the  fourth  moment  of  h 
and  by  Callaert  and  Janssen  (1978)  under  the  assumption  that  the  third  absolute 
moment  of  the  kernel  exists.  Recently,  Helmers  and  van  Zwet  (1982)  relaxed  this 
moment  condition  even  further  by  requiring  that  E | h ( X^ , X^ ) | P  <  00  for  some 
p  >  5/3.  A  Berry-Esseen  theorem  for  U-statistics  when  the  sample  size  is 
random  is  given  by  Ahmad  (1980)  when  the  random  size  is  independent  of  the 
observations.  A  special  case  of  Ahmad's  result  is  a  Berry-Esseen  theorem 


-  •'  -<"v 
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for  random  sums  which  is  also  discussed  by  Landers  and  Roqqe  (1976);  however, 
tin*  latter  authors  do  not  assume  the  independence  between  the  sample  observations 
and  the  sample  size  but  assume  more  stringent  conditions  on  { Nn ) .  the  sequence  of 
positive  integer-valued  random  variables  whose  values  are  the  sample  sizes.  An 
Edqeworth  expansion  for  the  d.f.  of  with  remainder  o(N~^)  has  been  obtained 

by  Callaert,  Janssen  and  Veraverbeke  (1979)  assuming  the  existence  of  the  fifth 
absolute  moment  of  h  and  some  regularity  conditions  on  the  kernel.  The  same 
result  is  established  by  these  authors  in  another  paper  (1980)  under  a  set  of 
less  restrictive  but  less  tractable  set  of  conditions. 

Finally,  for  the  class  of  Neyman's  C(a)  tests,  Chibisov  (1972a)  obtained 

1 

the  term  of  order  N%  in  the  normal  approximation  for  the  distributions  of  the 
test  statistics.  In  another  paper,  he  (1972b)  derived  asymptotic  expansions 
for  the  distributions  of  some  test  criteria  for  testing  compound  hypotheses. 

5.  Asymptotic  Expansions  for  Estimators.  In  this  section,  we  discuss 
asymptotic  expansions  for  a  number  of  statistics  occurring  in  parametric 
models  such  as  maximum  likelihood  (m.C.)  estimators  and,  more  generally, 
minimum  contrast  (m.c.)  estimators.  Of  course,  there  could  be  tests  based 
on  these  estimators  and  thus  could  have  been  included  in  the  previous  section. 

But  they  are  considered  here  to  emphasize  their  roles  in  estimation. 

Linnik  and  Mitrofanova  (1965)  obtained  asymptotic  expansions  for  the 
distributions  of  m.f.  estimators.  Mitrofanova  (1967)  extended  the  result 
for  vector  parameters.  As  pointed  out  by  Pfanzagl  (1973a,  p.  998)  and 
Bickel  (1974,  p.  11),  her  proof  contains  serious  gaps.  Chibisov  (1973  a,b) 
extended  the  work  of  Linnik  and  Mitrofanova  (1965)  to  a  wider  class  of  m.c. 
estimators  for  a  single  parameter  under  progressively  much  weaker  conditions. 
Pfanzagl  (1973a)  also  deals  with  Edgeworth  expansions  for  m.c.  estimators  for 
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a  single  parameter.  Earlier,  Michel  and  Pfanzagl  (1971)  and  Pfanzagl  (1971) 
established  the  order  of  the  error  of  the  normal  approximation  for  m.c. 
estimators.  In  another  paper,  Pfanzagl  (1973b)  showed  that,  for  m.c.  estimators 
of  vector  parameters,  the  approximation  by  normal  distribution  holds  with 
an  error  of  order  o(N~2)  uniformly  over  the  class  of  all  convex  sets.  For  a 
discussion  of  the  results  of  Linnik  and  Mitrofanova  (1965),  Pfanzagl  (1971, 
1973a),  and  Michel  and  Pfanzagl  (1971),  reference  should  be  made  to  Bickel 
(1974). 

Skovgaard  (1981b)  has  discussed  the  method  of  computing  the  Edgeworth 
expansions  of  the  distributions  of  m.f.  estimators  in  the  non  i.i.d.  case. 

He  has  also  given  the  first  four  terms  of  the  corresponding  stochastic  expan¬ 
sion. 

Edgeworth  expansions  of  distributions  of  Fisher-consistent  estimators 
for  curved  exponential  family  of  parent  distributions  (assumed  to  be  dominated 
by  the  Lebesgue  measure)  are  obtained  by  Ghosh,  Sinha  and  Subrahmanyam  (1979). 
This  result  leads  to  second-order  efficiency  of  the  m.t.  estimator  w.r.t.  any 
bounded,  bowl  shaped  loss  function.  The  formal  Edgeworth  expansions  here  are 
not  valid  without  the  assumption  of  the  dominating  Lebesgue  measure. 

Another  type  of  results,  though  not  concerned  directly  with  the 
distributions  of  estimators,  is  the  rate  of  convergence  to  normality  of 
stopping  times  associated  with  sequential  estimation.  One  such  result  is 
given  by  Ghosh  (1980). 

6.  Edgeworth  and  Cornish-Fisher  Expansions  in  Selection  and  Ranking 
Procedures.  We  will  restrict  our  attention  here  to  the  so-called  subset 
selection  procedures.  For  an  overall  view  of  the  theory,  we  refer  to 
Gupta  and  Panchapakesan  (1979)  and  Gupta  and  Huang  (1981).  A  typical  problem 
associated  with  many  classical  subset  selection  rules  is  the  evaluation  of 
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the  constant  d,  d  ■  0,  such  that  P[  max  (X.-Xn)  ■  d]  =  P*  where  P*  is 

lik  1  u 

specified  and  Xg,  X^,...,X^  are  i.i.d.  random  variables  with  the  real 
line  as  the  support,  or  the  evaluation  of  the  constant  c,  c  >  1,  such  that 
P[  max  ( X / XQ )  <  c]  =  P*  where  XQ,  X^,...,Xk  are  i.i.d.  random  variables 

with  the  positive  real  axis  as  the  support.  We  will  now  discuss  one  such 
problem. 

Let  Xg,  Xj,...,X  be  i.i.d.  normal  random  variables  with  mean  u  and 

2  ?  2 

variance  u  .  Let  s  be  an  estimator  of  o  which  is  independent  of  the  X^ 

2  2 

such  that  vs  /o  has  a  chi-square  distribution  with  v  degrees  of  freedom. 
Define  Y  -  (X[p]'Xq)/sv»  where  X^j  =  max(X^ , . . .  ,Xp) .  The  statistic  Y 
arises  in  the  problem  of  selecting  a  subset  of  normal  populations  (with  a 
common  unknown  variance  o  )  which  contains  the  "best"  population  with 
probability  at  least  P*.  This  problem  was  studied  by  Gupta  (1956).  For 
implementation  of  his  procedure,  we  need  to  evaluate  appropriate  percentage 
points  of  the  distribution  of  Y.  This  statistic  also  appears  in  a  few 
other  selection  problems  [see  Gupta  and  Sobel  (1957)].  The  probability 
P[Y  y]  can  be  evaluated  by  using  an  Edgeworth  expansion  of  standardized 
Y.  For  evaluating  the  percentage  points  of  Y,  we  can  use  Cornish-Fisher 
expansions.  We  will  illustrate  this  method  of  evaluation  and  discuss  some 
numerical  results. 


For  r  •  v,  we  can  write  the  rth  moment  of  Y  in  the  form 

[Hi 

S  Xr  -i  -  X  l2j 

“r  ■  E[(  -[fJ— >rJ  ’  A.r  lo  (J,)  ^  aP,r-2j  ’ 


where  [r/2]  is  the  integral  part  of  r/2,  a  .  is  the  ith  moment  of  the 

P  »  * 

largest  of  p  independent  standard  normal  random  variables,  and 
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r(^) 
(f)e/2  r(J) 


provided  that  $  >  -v. 

2 

Let  a  be  the  variance  of  Y.  Let  y(P*)  and  ys(P*)  denote  the  100  P* 
percentage  points  of  the  distributions  of  Y  and  Y$  =  (Y-yp/o,  respectively. 
Let  =  <r/ar ,  where  is  rth  cumulant  of  Y.  The  Cornish-Fisher  expansion 
for  ys ( P* )  is  given  by 


(6.3)  ys(P*)  =  z(P*)  +  [a3I  ]  +  [a4Id  +  a2I  2]  +  [«5I  +  «3«4Icd  +  ^  3]  + 

c  c 

Wf  +  a3“5Ice  +  a4Xd2+  a3a4I(.2d  +  “3^4  3  + 

2  2 

t^g  +  a3a6Icf  +  a4a5Ide  +  a3a5I  2  +  a3a4!  .2  + 

c  e  cd 

3  5 

a3a4 1  3  ,  +  a3!  5]  +  C<*8!h  +  a3a7Icg  + 
c  d  c  3 

a4a6Idf  +  a5*2  +  a3a6l  2f  +  a3a4u5Icde  + 
e  c  f 

3t  ,  3  t  ,22.  ,  4  .  .  5,  ■) 

a4!  .3  +  a3w5I  3  +  a3a4  1  2^2  +  a3u4  4.  +  a3J  5J  +  "-’ 
d  ce  cd  cd  c 


where  z(P*)  is  the  standard  normal  deviate  corresponding  to  P*  and 

Ic.  Id>  I  2”--  are  tabulated  in  Table  II  of  Fisher  and  Cornish  (1960)  for 
c 

P*  =  .5,  .75,  .90,  .975,  .99,  .995,  .9975,  .999  and  .9995.  Now,  y(P*)  = 

yi  +  oy  (P*).  For  calculating  u’  using  (6.1),  the  values  of  a„  ■  are 
IS  r  p,  1 

available  in  Ruben  (1954)  for  i  =  1(1)10  and  p  =  1(1)50. 

Gupta  and  Sobel  (1957)  tabulated  the  y(P*)-values  correct  two  decimal 

places  using  four  adjustment  terms.  Their  table  ranges  over  the  following 
values:  P*  =  .75,  .90,  .95,  .975,  .99;  p=  1,4,9(1)15(2)19(5)39,49; 

v  =  15(1 ) 20, 24, 30, 36, 40, 48, 60, 80, 100, 1 20,360,“.  The  y(P*)-values  can  also 
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be  obtained  from  the  tables  of  Krishnaiah  and  Armitage  (1966)  for  P*  =  .95, 
.99;  p  =  1(1)10  and  v  =  5(1)35.  These  latter  tables  were  computed  using 
Gauss-Hermi te  and  Gauss-Laguerre  quadrature  formulas.  The  two  tables  agree 
well  in  all  the  common  cases. 

Another  method  of  evaluating  the  percentage  points  of  Y  is  based  on 
the  result  of  Hartley  (1943-46),  who  obtained  the  probability  P[Y  <_  y]  as 
a  solution  of  certain  difference-differential  equation.  This  result  can 
be  stated  in  the  form 

(6.4)  p[y  ^  y]  =  I0(y)  +  ^  [y2f  I2(y)-I0(y) }  -  yi^y)] 

+  y4( 3i Q(y ) -6 1 2 (y )  +  i4(y) > 

1 6\i 

-  jy3U3(y)-3i1(y)} 

-  ^y2{i2(y)-i0(y)l 

+  \  yi^y)]. 

where 

, .  ikzi! 

(6.5)  Ii(y)=—  /e  2  (t-y)J  tp(t)dt. 

It  should  be  noted  that  Iq(/2H)  is  the  probability  that  the  maximum  of  p 
equally  correlated  standard  normal  variates  with  the  correlation  equal  to 
0.5  does  not  exceed  H.  This  probability  for  several  values  of  H  and  the 
values  of  H  corresponding  to  several  probability  levels  are  tabulated  by 
Gupta,  Nagel  and  Panchapakesan  (1973). 

In  Table  1  below  we  give  the  values  of  y(P*)  obtained  in  different 
ways  for  P*  =  .90,  .95;  p  =  1,  4,  9  and  v  =  17,  18,  20,  60.  For  each 
combination  of  the  values  of  P*,  p  and  v,  the  first  three  entries 


16 


correspond  to  the  values  obtained  by  taking  four,  six  and  eight  adjustment 
terms,  respectively,  in  the  Cornish-Fisher  expansion.  The  last  entry  is 
obtained  by  evaluating  P[Y  <_  y]  in  (6.4)  using  Gauss-Hermi te  quadrature 
formula  and  bisection  method.  All  these  computations  except  those  based 
on  Cornish-Fisher  expansion  with  four  adjustment  terms  are  new  with  this 
paper. 


TABLE  1 

VALUES  OF  y(P*)  OBTAINED  BY  CORNISH-FISHER  EXPANSION  WITH  4,  6, 
AND  8  ADJUSTMENT  TERMS  (LINES  1 ,  2  AND  3)  AND  HARTLEY'S  EXPANSION 
WITH  TWO  ADJUSTMENT  TERMS  (LINE  4) 

P*  =  .90 


p  1  4  9 


1.89 

2.75 

3.18 

17 

1.88824 

2.75879 

3.19306 

1.86438 

2.53829 

2.62070 

1.88570 

2.74894 

3.18010 

1.88 

2.74 

3.17 

18 

1 .88323 

2.74729 

3.17778 

1.86846 

2.59753 

2.78203 

1 .88147 

2.74023 

3.16854 

1.87 

2.72 

3.15 

20 

1.87525 

2.72949 

3.15419 

1.86881 

2.65339 

2.95164 

1.87433 

2.72557 

3.14908 

1.83 

2.64 

3.04 

60 

1.83257 

2.64050 

3.03672 

1.83256 

2.64018 

3.03602 

1 .83257 

2.64048 

3.03670 

ii‘  I'iiMtii  I  -■ 
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TABLE  1  (cont.) 

P*  =  .95 

\  l 

v 

1  1 

4 

9 

2.46 

3.29 

3.72 

17 

2.47450 

3.32350 

3.76197 

2.41414 

3.30553 

3.90291 

2.46047 

3.29995 

3.73373 

2.45 

3.28 

3.70 

18 

2.46248 

3.30146 

3.73310 

2.42389 

3.30025 

3.84884 

2.45258 

3.28516 

3.71421 

2.44 

3.25 

3.67 

20 

2.44463 

3.26891 

3.69093 

2.42682 

3.27674 

3.76500 

2.43929 

3.26042 

3.68176 

2.36 

3.12 

3.50 

60 

2.36268 

3.12065 

3.50159 

2.36264 

3.12092 

3.50225 

2.36267 

3.12065 

3.50162 

We  can  see  from  Table  1 

some  general  indications 

about  the  effect  of 

increasing 

the  number  of  adjustment  terms.  It  appears  that 

(1) 

the  approximations 

based  on  four  adjustment 

terms  are  generally 

quite  good 

,  certainly  to  two 

decimal  places; 

(2) 

adding  additional  terms  ma.y  not  produce  desirable  results  for  small 

values  of 

v,  large  values  of 

p,  and  P*  values  closer 

to  1 ; 

(3) 

as  v  gets  larger  than  20,  the  terms  in  the  < 

expansion  stabilize 

faster  and  more  so  for  small  p;  and 

(4)  the  values  obtained  by  using  Hartley's  result  with  two  adjustment 


terms  seem  to  be  adequate. 
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7.  Concluding  Remarks.  In  the  previous  sections,  we  discussed  asymptotic 
expansions  for  the  distribution  functions  of  random  variables  which  have  a  limit¬ 
ing  normal  distribution.  However,  there  are  test  statistics  of  practical  im¬ 
portance  which  have  nonnormal  limiting  distributions,  for  example,  the  statis¬ 
tics  that  arise  in  likelihood  ratio,  goodness  of  fit,  Kolmogorov-Smirnov, 
Cramer-von  Mises  and  Pearson  chi-square  tests.  We  will  briefly  refer  to  ex¬ 
pansions  in  some  of  these  cases. 

Let  X  have  a  continuous  d.f.  F{x)  and  let  Fp(x)  denote  the  empirical  dis¬ 
tribution  corresponding  to  n  independent  observations  on  X.  Define  = 

sup  /n  ( (x)-F(x) )  and  Dn  =  sup  /n  |  F  ( x)-F(x) ] .  Smirov  (1944)  studied 

X*-'*’  _co<X<n> 

the  distribution  of  D*  and  showed  that,  for  0  <  x  <  o(n^), 

2 

(4.1)  <(x)  =  P[Dp<x]  =  1  -  e'2x  [1  +  ^0(1)]. 

This  result  was  improved  by  Li-Tsian  Chan  (1955)  who  added  two  more  terms  for 
the  same  range  of  x;  he  showed  that 

(4.2)  >p(x)  =  1  -  6'2X2[1+  3^+^r(1-^)  + 

-$n  + 

These  results  are  discussed  in  Gnedenko,  Koroluk  and  Skorokhod  (1961).  Later, 
Lauwerier  (1963)  gave  the  following  expansion  for  <fp(x): 


♦:(*)  =  i  - 


nVn/2^ 


e'2x  l  fAH)^'2  , 


where  the  symbolic  expression  f ^ ( H )  stands  for  a  polynomial  fj(t)  in  which  the 

powers  tm  are  replaced  by  the  Hermite  polynomials  Hm(2x).  The  polynomials  f. 

are  determined  by  certain  generating  series.  Li-Tsian  Chun  (1956)  obtained 

_2 

also  an  expansion  for  the  d.f.  of  Dp  with  a  remainder  0(n  ).  Another 
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reference  in  this  connection  is  Borovkov  (1970),  who  treated  these  as  special 
cases  of  more  general  problems  of  first  passage  times  of  random  walks. 

tor  the  likelihood  ratio  criterion,  Hayakawa  (1977)  gave  a  formal  expan¬ 
sion.  This  was  later  justified  by  the  results  of  Chandra  and  Ghosh  (1980), 
who  gave  valid  expansions  for  the  likelihood  ratio  statistic  as  well  as  Wald's 
and  Rao's  statistics  under  contiguous  al ternati ves .  The  general  results  of 
Chandra  and  Ghosh  are  obtained  under  a  set  of  conditions  in  an  earlier  paper 
of  theirs  (1979)  besides  Cramer's  condition  and  smoothness  conditions  on 
moments . 

finally,  an  excellent  exposition  of  Edgeworth  and  saddle-point  approxi¬ 
mations  for  the  densities  of  sums  of  independent  random  vectors  has  been 
given  by  Barndorff-Nielson  and  Cox  (1979). 
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